Comparing and integrating Tree Adjoining Grammars
نویسندگان
چکیده
Grammars are core elements of many NLP applications. Grammars can be developed in two ways: built by hand or extracted from corpora. In this paper, we compare a handcrajted grammar with a Treebank grammar. We contend that recognizing substructures of the grammars' basic units is necessary tures and semantic information which are rarely represented in the corpora. lt would be ideal if we could combine the strengths of both types of grammar. As a first step towards addressing this issue, in this paper we compare a hand-crafted grammar with a Treebank grammar and propose a way of integrating them to produce new grammars. Two grammars not only because it allows grammars to be compared at a higher level, but also because 2. it provides the building blocks f or consistent and efficient integration of the grammars. The two LTAGs that we compare are the XTAG English grammar (XTAG-Group, 1995) and a grammar extracted from Penn English Treebank. The XTAG grammar has 1004 tree templates.1 The Treebank grammar that we use in this paper is extracted from the Penn English Treebank II (Marcus et al., 1994) using the extraction algorithm described in (Xia, 1999}. The extracted grammar has 3072 templates.
منابع مشابه
PreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars
Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...
متن کاملThe Relationship Between Tree Adjoining Grammars And Head Grammarst
67 We examine the relationship between the two grammatical formalisms: Tree Adjoining Grammars and Head Grammars. We briefly investigate the weak equivalence of the two formalisms. We then turn to a discussion comparing the linguistic expressiveness of the two formalisms.
متن کاملSemTAG: a platform for specifying Tree Adjoining Grammars and performing TAG-based Semantic Construction
In this paper, we introduce SEMTAG, a free and open software architecture for the development of Tree Adjoining Grammars integrating a compositional semantics. SEMTAG differs from XTAG in two main ways. First, it provides an expressive grammar formalism and compiler for factorising and specifying TAGs. Second, it supports semantic construction.
متن کاملIntegrating a Unification-Based Semantics in a Large Scale Lexicalised Tree Adjoining Grammar for French
In contrast to LFG and HPSG, there is to date no large scale Tree Adjoining Grammar (TAG) equiped with a compositional semantics. In this paper, we report on the integration of a unification-based semantics into a Feature-Based Lexicalised TAG for French consisting of around 6 000 trees. We focus on verb semantics and show how factorisation can be used to support a compact and principled encodi...
متن کاملAutomatically Extracting and Comparing Lexicalized Grammars for Different Languages
In this paper, we present a quantitative comparison between the syntactic structures of three languages: English, Chinese and Korean. This is made possible by first extracting Lexicalized Tree Adjoining Grammars from annotated corpora for each language and then performing the comparison on the extracted grammars. We found that the majority of the core grammar structures for these three language...
متن کامل